To Memorize or to Predict: Prominence labeling in Conversational Speech

نویسندگان

  • Ani Nenkova
  • Jason M. Brenier
  • Anubha Kothari
  • Sasha Calhoun
  • Laura Whitton
  • David Beaver
  • Daniel Jurafsky
چکیده

The immense prosodic variation of natural conversational speech makes it challenging to predict which words are prosodically prominent in this genre. In this paper, we examine a new feature, accent ratio, which captures how likely it is that a word will be realized as prominent or not. We compare this feature with traditional accent-prediction features (based on part of speech and N-grams) as well as with several linguistically motivated and manually labeled information structure features, such as whether a word is given, new, or contrastive. Our results show that the linguistic features do not lead to significant improvements, while accent ratio alone can yield prediction performance almost as good as the combination of any other subset of features. Moreover, this feature is useful even across genres; an accent-ratio classifier trained only on conversational speech predicts prominence with high accuracy in broadcast news. Our results suggest that carefully chosen lexicalized features can outperform less fine-grained features. Disciplines Computer Sciences Comments Nenkova, A., Brenier, J., Kothari, A., Calhoun, S., Whitton, L., Beaver, D., & Jurafsky, D., To Memorize or to Predict: Prominence Labeling in Conversational Speech, Human Language Technology Conference of the North American Chapter of the association of Computational Linguistics, April 2007. http://www.aclweb.org/ anthology/N07-1002 Author(s) Ani Nenkova, Jason Brenier, Anubha Kothari, Sasha Calhoun, Laura Whitton, David Beaver, and Dan Jurafsky This conference paper is available at ScholarlyCommons: http://repository.upenn.edu/cis_papers/732 To Memorize or to Predict: Prominence Labeling in Conversational Speech A. Nenkova, J. Brenier, A. Kothari, S. Calhoun, L. Whitton, D. Beaver, D. Jurafsky Stanford University {anenkova,jbrenier,anubha,lwhitton,dib,jurafsky}@stanford.edu †University of Edinburgh [email protected]

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Relative Importance in English and Persian: Thematization or Tonic Prominence?

There are two common ways to assign relative importance in spoken language: tonic prominence and thematization. The former is expressing the main points of information units in speech (Halliday, 1994), and the latter is putting an element at the beginning of a clause. This study explores how relative importance is realized in English and Persian. It also investigates how advanced Persian learne...

متن کامل

Detecting Prominence in Conversational Speech: Pitch Accent, Givenness and Focus

The variability and reduction that are characteristic of talking in natural interaction make it very difficult to detect prominence in conversational speech. In this paper, we present analytic studies and automatic detection results for pitch accent, as well as on the realization of information structure phenomena like givenness and focus. For pitch accent, our conditional random field model co...

متن کامل

Ethnomethodology and Conversational Analysis

In a speech community, people utilize their communicative competence which they have acquired from their society as part of their distinctive sociolinguistic identity. They negotiate and share meanings, because they have commonsense knowledge about the world, and have universal practical reasoning. Their commonsense knowledge is embodied in their language. Thus, not only does social life depend...

متن کامل

Automatic prominence annotation of a German speech synthesis corpus: towards prominence-based prosody generation for unit selection synthesis

This paper describes work directed towards the development of a syllable prominence-based prosody generation functionality for a German unit selection speech synthesis system. A general concept for syllable prominence-based prosody generation in unit selection synthesis is proposed. As a first step towards its implementation, an automated syllable prominence annotation procedure based on acoust...

متن کامل

Using Conditional Random Fields to Predict Pitch Accents in Conversational Speech

The detection of prosodic characteristics is an important aspect of both speech synthesis and speech recognition. Correct placement of pitch accents aids in more natural sounding speech, while automatic detection of accents can contribute to better wordlevel recognition and better textual understanding. In this paper we investigate probabilistic, contextual, and phonological factors that influe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007